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Abstract 

Tetracyclines are an old class of molecules that constitute a broad-spectrum antibiotics. Since the first member 
of tetracycline family were isolated, the clinical importance of these compounds as therapeutic and prophylactic 
agents against a wide range of infections has stimulated efforts to define their mode of action as inhibitors of 
bacterial reproduction. We used three SAR methodologies for the analysis of biological activity of a set of 104 
tetracycline compounds. Our calculation were carried out using the semi-empirical Austin Method One (AMI) 
and Parametric Method 3 (PM3). Electronic Indices Methodology (EIM), Principal Component Analysis (PGA) 
and Artificial Neural Networks (ANN) were applied to the classification of 14 old and 90 new proposed derivatives 
of tetracyclines. Our results make evident the importance of EIM descriptors in pattern recognition and also 
show that the EIM can be effectively used to predict the biological activity of Tetracyclines. 

1 Introduction 

Tetracyclines (TC) are a set of compounds used in human therapy against a broad-spectrum of microbial agents 
since 1947. They can be classified as low cost drugs with small side-efFects being largely used by food industry and 
for therapy in animals and plants [1]. TC present activity against Gram-positive and Gram-negative anaerobic and 
aerobic bacteria, mycrobacterium and several protozoan parasites [2] . Non-antibacterial effects as anti-inflammatory, 
immunosuppressive [3], antioxidant [4] and anticancer [5, 6] are also attributed to them. 

The excessive use of TC during years helped resistant bacteria proliferation, limiting then their clinical use. More 
than 30 TC resistance determinants have been described [7, 8] and several different TC resistance determinants have 
been identified in E. coli [9, 10, 11]. This emergence of resistant pathogens is becoming an increasingly important 
problem and it motivates the search for new TC variations and derivatives [12]. TC can be divided into two classes, 
the bacteriostatic typical TC and atypical TC, which are bactericidal and they exert their effects by promoting 
bacterial autolysis. Most probably due to their different modes of action, atypical TC have been shown to exert 
activity against some TC resistant bacteria [13, 14]. 

The chemistry of TC in solution is quite complicated [15] due to their ability to adopt different conformations, 
protonation states, and tautomeric forms, depending on the conditions investigated [16]. TC and its analogues 
undergo complex formation with a variety of the metal cations present in biological fluids [17, 18, 19, 20] and this 
affects their chemical as well as conformational equilibrium in solutions. Theoretical and experimental investigations 
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Table 1: Set of investigated Tetracyclines labelled accordingly to Figure 1 and respective biological activity index 
[27]. For compound 12 the substituintes on carbon 11 and 12 are interchanged and for compound 13 carbons 3 and 
4 are linked to hydrogen atoms. 



Molecule 


Compound 


R5 


Rg 




R7 


Rg 




01 


Tetracycline 


H 


OH 


Me 


H 


H 


600.0 


02 


7-NH2-6-Demethyl-6-deoxytetracycline 


H 


H 


H 


NH2 


H 


190.0 


03 


5-Hydroxytetracycline 


OH 


OH 


Me 


H 


H 


650.0 


04 


9-N02-6-Demethyl-6-deoxytetracycline 


H 


H 


H 


H 


NO2 


145.0 


05 


9-NH2-6-Demethyl-6-deoxytetracycline 


H 


H 


H 


H 


NH2 


350.0 


06 


7-Cl-6-Demethyltetracycline 


H 


OH 


H 


CI 


H 


1100.0 


07 


7-Cl- Tetracycline 


H 


OH 


Mc 


CI 


H 


950.0 


08 


7-N02-6-Demethyl-6-deoxytetracycline 


H 


H 


H 


NO2 


H 


1450.0 


09 


9-N(CH3)2-6-Demethyldeoxytetracycline 


H 


H 


H 


H 


N(Me)2 


76.0 


10 


6-Demethyl-6-deoxytetracycline 


H 


H 


H 


H 


H 


250.0 


11 


6-Demethylamino-6-demethyldeoxytetracycline 


H 


H 


H 


N(Me)2 


H 


250.0 


12 


5a,6-Anhydrotetracycline 


H 


Me 




H 


H 


130.0 


13 


1 2a-Deoxytetracycline 


H 


OH 


Me 


H 


H 


8.5 


14 


7-Br-6-Demethyl-6-deoxytetracycline 


H 


H 


H 


Br 


H 


70.0 



have contributed to the understanding of the interaction of TC with biological medium and their mode of action 
[21, 22, 23, 24, 25, 26]. Theoretical calculations have also been applied to investigations of new applications for old, 
new and hypothetical TC derivatives [22]. 

In this work we present an investigation of electronic and quantum features for a series of TC (Figure 1, Table 1) 
for which the biological activity against E. Coli is known and also to new compounds [27]. Our results showed that 
it is possible to directly correlate some quantum electronic descriptors to TC biological activity. This information 
is then used in the design of new compounds. 



2 Methodology 

We started our investigation with a detailed search for the global minimum of energy for 14 TC derivatives with 
known biological activity (Figure 1, Table 1). In the geometrical search of each molecule all degrees of freedom 
have been allowed to change during optimization and special atention has been devoted to the NH(Me)2 group 
bounded to carbon 4, which is known to influence TC solution and gas phase stability [16]. All the calculations 
have been performed with semiempirical Austim Method 1 (AMI) [28] and Parametric Method 3 (PM3) [29, 30]. 
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The AMI and PM3 methods are widely used in the investigation of organic molecules with very good results [31]. 

In this study one of the points we would like to evaluate is whctcr the geometry and the correlation of theoretical 
properties with biological activity are dependent on the used semiempirical method. Molecules are studied in gas 
phase adopting the zwiterionic form, that is one of three TC basic forms and considered to be the active species 
in the biological medium. In zwiterionic form the total charge in molecule is equal zero hut the groups bounded 
to the carbons 3 and 4 are charged and set to 0~ and NH~^{Me)2, respectively [27]. Although we can expect 
that Hartree-Fock methods such as AMI and PM3 could have some difficulties in precisely discribing these forms, 
a systematic error is expected. Zwiterionic together with NH~ ionized species are commonly present in aqueous 
solutions and it is believed to complex with ions metals as Ca^+, Mg^+, Fe^+, Zn-^+, Al'^+ in biological medium, 
affecting the bioavailability and therapeutic TC properties [32, 33, 34]. 

For the geometry optimization and properties calculation we have used the Chem2Pac package [35]. Once 
optimized, the conformations of minimun energy were the basis for theoretical properties and electronic descriptors 
calculations. We based our structure-activity relationship (SAR) investigation on the Electronic Indices Methodology 
(EIM) and its descriptors. EIM was initially developed to identify the carcinogenic activity of polycyclic aromatic 
hydrocarbons [37, 36] using very simple rules. The methodology uses only two descriptors called A and rj. The A 
is defined as the energy separation of molecular levels as indicated in Eq.(l). 



The descriptor t/, is based on the Local Density of States (LDOS), which is obtained as the Density of States 
(DOS) for a selected group of atomic orbitals or atoms. The r] is calculated as the LDOS difference between the 
most relevant molecular electronic levels (defined in the A calculation) over molecular regions identified as related 
to biological activity: 



Ca is the cocficient of atomic orbital from LCAO (Linear Combination of Atomic Orbitals) approximation, ai 
and ttn refer to atoms composing the selected molecular region. A contains global molecular information (involving 
the eigenvalues) while r] contains local molecular information (LDOS) assumed to reflect the molecular environmental 
biochemical mechanisms (for details of EIM see refs. [37, 36, 38, 39, 35, 40, 41]). 

With only these two descriptors EIM has classified with sucess, as active and inactive, molecules of differente 
classes of activity including steroid hormones [38, 39], integrase inhibitors [35], antibiotics [35], and tumoral [40] 
and antitumoral compounds [41]. Usually the EIM rules involve critical values assuming the boolean form: IF 
A > (<)Ac AND (OR) T] > {<)r]c, the molecule will be active, otherwise it will be inactive. Ac and rjc are 
critical values determined from EIM analysis. The critical values and EIM rules are automatically obtained from an 
exploratory scanning. In contrast to other SAR methods, EIM does not use training sets. 

In order to obtain the EIM parameters and classificatory rules, the common molecular skeleton of the group 
of compounds investigated is divided into regions. Available informations about important molecular regions to 
determine biological activity is taken into account in the choice of the regions to the LDOS calculations. The 
present study we divided TC into 6 regions: 

1. ring A and radicals; 

2. ring B and radicals; 

3. ring C and radicals; 



(1) 
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4. ring D and radicals, exception to groups CONH2 and N~^H{Me)2; 

5. radical N+H{Me)2; 

6. radical COiVi?2. 

The group of 14 molecules has been divided into two subgroups accordingly to their biological activity. Our 
criterion to classify molecules as active or inactive was based on the experimental values of Ksens index presented 
in Table 1. Molecules with K^ens lower than 100 arc classified as inactive and those with K^ens greater than 100 as 
actives. The if sens index represents the inhibition velocity constant based on cell growing velocity law [27]. 

The importance of electronic descriptors was evaluated by comparing EIM results with those obtained with 
standard SAR methods, Principal Component Analysis (PC A) and Artificial Neural Networks (ANN). 

The PC A method is an extremelly useful explorative tool to separate samples into groups accordingly to their 
similarity and also to select important descriptors to these separations. In a geometrical interpretation each descriptor 
is considered a vector in a N-dimensional space and PCA method constructs a new space where new vectors, called 
principal components (PC), are linear combinations of the original descriptors. In the PCs space the samples 
are mapped through scores and the original descriptors through loading. Prom the loading plots, the important 
descriptors can be identified and the correlation pattern among them can be established. These calculations were 
carried out with the Einsight program [42] 

The ANN analysis was carried out using the program package Perceptron-typc Neural Network Simulator with 
back-propagation algorithm (PSDD) [43, 44]. PSDD consists of a thrcc-laycr pcrccptron network originally developed 
for drug design. This is a non-linear method and offers great advantage over conventional SAR methods for the 
cases where the target properties are not linearly dependent on the chosen parameters. In the ANN analysis we 
considered only the EIM descriptors to classify old and new proposed tetracyclines. 

3 Results and Discussions 
3.1 Structures 

For the geometry optimization a conformational search was carried out considering the systematic variation of the 

dihedral angles related to the radicals bounded to the molecular skeleton. We initialy used the rotational step of 10 
degrees to the OH, Me, NO2, NH2, CONH2, and NH(Me)2 groups in a rigid-rotor approach (one self consistent field 
calculation) using AMI and PM3 hamiltonians. The ten best conformations (in terms of energy) were then selected 
and their geometries were fully optimized. All calculations (AMI and PM3) have been performed with Mopac 6 
implemented in Chem2Pac program [35] using high quality numerical precision optimization criteria. 

An important characteristic of TC geometries is the presence of NH~^{Me)2 group bounded to carbon 4. For 
the most stable conformations we investigated in details the energy variation related to the rotation of this group. 
For this specific part of study we optimized the geometry of each conformation generated by a rotation of 5 degrees 
at the dihedral formed by the hydrogen and nitrogen atoms of NH^{Me)2 group, carbon 4, and hydrogen bound on 
carbon 4 (£*/iArH+(Me)2)- The complete rotation of 360° showed the existence of the £'/iArjy+(Me)2 two local minima 
of energy. The structure at Dh=-180 degrees is named extended and the other at Dh=30 degrees twisted [24, 25, 16]. 
In Figure 2 we present the result obtained to molecule 01, for all the other molecules the curve of energy is similar. 
The relative energy is calculated considering the lowest energy as zero of reference. As can be observed from Figure 
2 the lowest energy conformer corresponds to the dihedral £'/iAr_f/+(Me)2 around 180 degrees. From the inset in the 
Figure 2 we can observe that in this conformation the hidrogen of NH~^{Me)2 is directed toward the OH group at 
C12a. The second local minimum conformation is observed for -D/ijvi?+(Me)2 around 35 degrees and in this case the 
hydrogen atom points to 03 moiety. 
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Figure 2: Relative energy for the rotation of dihedral Dht^N {Ade)2) calculated with AMI method to molecule 
01. The -180 and 180 degrees correpond to the same conformation, the called extended conformation. The twisted 
conformation are located around 35 degrees. Structures in the inset show details of these conformations. 



In order to better illustrate these conformations we show in Figure 3 the lowest energy conformer for molecule 
01 in the extended conformation. 

In Table 2 we present the main dihedral angles characterizing the TC rings from the global minimum of each 
compound calculated with AMI. Similar results were obtained with PM3 method. 

As can be observed from the values of dihedrals 5-4a-12a-l and 3-4-4a-12a (Table 2) all the molecules have the A 
ring non co-planar with the plane formed by rings D, C, and B. The rings C and B present a small torsion induced 
by the hydrogen atom bounded to carbon 5a. Torsions angle values between D-C and C-B rings are shown in second 
and third columns, respectively. 




I- i , I : I I I : \ I I I l__J I . I I I I . «-- 

-ISO -150 -1 20 -90 -N) -?10 30 60 W 120 150 ISO 
Angle ( degree ) 



Figure 3: Molecule 01 optimized structure from AMI calculation. 



Table 2: Main latex article templatedihedral angles (in degrees) for optimized TC structures with AMI method. 
The second, third, and fourth columns show the planarities between rings D-C, C-B, and B-A, respectively. The 
fifth column indicates A ring angle torsions values. Labelling accordingly to Figure 1. 
Molecule 7-6a-10a-ll 6-5a-lla-12 5-4a-12a-l 3-4-4a-12a 



01 


-178.7 


-138.5 


-78.9 


60.6 


02 


-175.7 


-147.3 


-84.8 


60.1 


03 


-171.4 


-121.6 


-66.4 


22.1 


04 


-178.1 


-139.1 


-78.6 


62.4 


05 


-178.0 


-139.7 


-79.5 


62.4 


06 


-179.7 


-140.9 


-79.9 


60.5 


07 


-165.7 


-141.7 


-82.2 


62.7 


08 


-174.5 


-140.9 


-80.1 


60.7 


09 


-174.6 


-140.0 


-79.8 


61.0 


10 


-178.9 


-139.4 


-79.2 


62.4 


11 


-176.7 


-139.8 


-79.9 


61.0 


12 


174.1 


178.8 


-72.3 


59.8 


13 


-177.9 


-136.7 


-72.2 


54.3 


14 


-177.8 


-140.3 


-79.9 


60.9 



3.2 The EIM Method and PCA Analysis 

From the EIM analysis we obtained the A for pairs of frontier orbitals (Highest Occupied Molecular Orbital-HOMO, 
Lowest Unoccupied Molecular Orbital- LUMO and next closest levels) and the rj data for all the regions listed in the 
methodology section. We observed that the best results for both AMI and PM3 methods are obtained for AH (Eq. 
3) and rjH (Eq. 4) calculated over the region 6 (radical CONH2 bounded to carbon 2). In this case the equations 1 
and 2 became: 

^ = EjjOMO — EuOMO-1 (3) 



= 2 ^ {\Ca(^HOMO)? - \Ca{HOMO-l)?) (4) 

a — a\ 

The EIM parameter values are presented in Table 3. In order to build the EIM rules we determine the critical 
values of the descriptors AiJc and ryiJc- The classificatory rules are constructed trying to reproduce the experimental 
classes indicated in section 2 with relation to the Kg^ns values. For AMI (Eq.5) and PM3 (Eq. 6) methods the 
following the rules have been obtained: 

IF AH > 0.70 AND rjH > 1.39 THEN molecule is inactive (5) 

IF AH < 0.58 OR ijH > -0.54 THEN molecule is active (6) 

These crittical values (in equations 5 and 6) were obtained from an automatic screening search that determine 
the best statistical relevance of these numbers. 

With these two descriptors and EIM simple rules for AMI data it was possible to reproduce the experimental 
activity (active or inactive) with 100% of accurary. For PM3 calculation the rules achieved ~93% of accuracy, only 
molecule 11 was incorrectly classified. 
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Table 3: EIM descriptors calculated from both AMI and PM3 methods. Molecules are presented according to their 
Ksf,ns values [27]. 

AMI PM3 

AH 

0.64 -0.56 

0.93 -0.56 
Inactive 0.59 -0.63 



Molecule 


sens ^ 10 


AH 


r?H 


13 


8.5 


0.98 


1.42 


14 


70.0 


0.89 


1.39 


09 


76.0 


0.76 


1.39 



12 


130.0 


0.18 


1.26 


0.282 


1.23 


04 


145.0 


0.94 


-0.59 


0.465 


-0.72 


02 


190.0 


0.11 


-1.38 


0.194 


-1.28 


10 


250.0 


0.86 


1.33 


0.50 


-0.71 


11 


250.0 


0.39 


1.39 


0.599 


-0.63 


05 


350.0 


0.12 


1.33 


0.311 


1.29 


01 


600.0 


0.99 


1.38 


0.581 


-0.63 


03 


650.0 


1.00 


1.38 


0.893 


-0.53 


07 


950.0 


0.81 


1.33 


Active 0-575 
0.465 


-0.63 


06 


1100.0 


0.94 


1.34 


-0.71 


08 


1450.0 


1.18 


-0.26 


0.554 


-0.63 



The EIM results were then contrasted against the PCA methodology. For PCA analysis we considered a set of 
25 parameters where electronic, physicochemical and stereochemical ones were included: 

• Ail 

• -qH 

• AL 

• 77L 

• EnergyijoMO 

• EnergyijoMO-i 

• Energyi,(7MO 

• Energyi,[/MO-M 

• LDOS (values from eigenstates HOMO, HOMO-1, LUMO, and LUMO-l-1) 

• heat of formation 

• surface area 

• molecular volume 

• hydratation energy 

• Log P 
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Figure 4: PCA - Scores from AMI data. 
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• refractivity 

• polarizability 

• molecular mass 

• electronic energy 

• nuclear repulsion 

• dipolar momentum 

• ionization potential 

• hardness. 

Physicochemical variables frequently used in QSAR analysis have also been included in the calculations to 
evaluate the relevance of EIM descriptors. All the parameters were obtained from AMI and PM3 calculations and 
PCA calculations have been carried out separatedly for them. 

The data have been auto-scaled prior PCA analysis and we focused our attention on the two first principal 

components. The best results separating active and inactive compounds are presented in Figs. 4 and 5 to AMI and 

PM3 descriptors, respectively. 

Molecules in PCs planes are separated accordingly to the equations: 
AMI 

PCI = -0.36{AH) + Q.58{-nH) + 0.74{LogP) (7) 
PC2 = +0.81(A//) + 0.58{r)H) - Om{LogP) (8) 

PM3 

PCI = +0.0S{HOMO) + Om{AH) - 0A2{HF) + 0.62(r;_H") (9) 
PC2 = +0.S2{HOMO) + 0.04(A_ff) + 0.54(HF) + 0.21(j7_H") (10) 
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Figure 5: PCA - Score from PM3 data. 
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For AMI and PM3 the PCI vs PC2 plane accumulates a total variance of 87.26% and 80.71%, respectively. 

From Figures 4 and 5 and equations 7, 8, 9, and 10 we can see which variables are statistically more relevant 
among all the others in the set investigated. With exception of LogP all other variables are electronic and related 
to the EIM descriptors. Its is worth mentioning that during variables selection all variables were equally considered, 
no parameter scalling was used. In Fig. 4 a line delimiting the two classes of compounds was traced. For this case 
the molecules tend to agglomerate and the separation is not so clear. The results for PM3 data (Figure 5) show 
molecules more disperse and active and inactive groups separated. As already observed from EIM results, for the 
PM3 hamiltonian the molecule 11 was incorrectly classified as inactive. 

These EIM and PCA results can be used together as an aplicable and fast tool in the classification of new 
Tetracyclines. In this sense we propose a set of 90 new hypothetical TC to what experimental data is unknown. 
We then used the EIM rules from the available experimental data to classify (active/inactive) these new hypotetical 
structures. 



3.3 New Proposed Tetracyclines 

We considered the structures of molecules 01, 12, and 13 as our basic skeletons (T^, T^, and T^) to generate new 
hypothetical TC (Fig. 6, 7, and 8). Combining these skeletons with common TC substituents (Table 1), plus some 
7A column elements of the periodic table it was possible to generate 90 synthesis feasible new TC derivatives. Table 
4 presents the 30 combinations of the substituents considered for each skeleton. The new 90 compounds where 
labelled as 01 to 30, 31 to 60, and 61 to 90 for substituents of Table 4 in T^, and T^, respectively. 

These new compounds were then geometrically optimized following the same conformational searches described 
in section 2 and for the global minima we similarly obtained the EIM parameters values (Ai/ and r\li) from group 
CONH2 bound to carbon 2 (Table 5) accordingly to equations 1 and 2. For these new compounds we considered 
only AMI data, since the EIM results proved to be very similar using AMI or PM3 and level of accuracy for AMI 
was 100%. 

We use the EIM rules derived from AMI calculations (equation 5) to classify the new TC as active or inactive 
(table 5). As can be observed from Table 5, 25 compounds are classified as actives while the remaining 66 are 
classified as inactives. 
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Table 4: New proposed TC compounds. Labelling accordingly to Fig. 7 and 8. 







xvg 
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Rn 
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LL 
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V 
11 


LL 
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LL 
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XT 
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Rr 
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LL 


Rr 
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n 
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LL 


uo 
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LL 


n 




XT 
LL 
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\J\J 






R 
11 


R 
11 


R 
11 






11 


R 
11 


R 
11 
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n 


n 




11 


tr 

LL 


Ui7 




OR 
yJLL 


n 


Rr 
JDl 


R 


1 n 

±u 
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OR 
yJLL 


Mo 


Rr 


R 

LL 


1 1 

± i 


XT 
n 


OR 

KJLL 


n 


T 
1 


pr 

LL 


1 9 


XT 

n. 


OR 
KJLL 




T 
1 


LL 


1 


n 


OR 
KJLL 


n 


V 
r 


LL 


14 


TJ 
n 


OR 
KJLL 
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r 


pr 

XI 


1 1^ 


T 
1 


n 


n 


TJ 
n 


pr 

LL 


Ifi 


J 


JJ 


JJ 


JJ 


J 


17 


JJ 


JJ 


JJ 


JJ 


J 


18 


H 


H 


H 


I 


H 


1 Q 


TJ 

LL 


n 


T 
1 


u 

LL 


pr 

LL 


9n 

^u 


LL 


R 
11 


R 
11 


V 
L: 


R 
11 


21 


H 


OH 


Me 


NH2 


H 


22 


H 


OH 


Me 


NO2 


H 


23 


H 


OH 


Me 


H 


NH2 


24 


H 


OH 


Me 


H 


NO2 


25 


OH 


OH 


OH 


OH 


OH 


26 


OH 


H 


H 


H 


N(Me) 


27 


H 


OH 


Me 


H 


N(Me) 


28 


H 


H 


H 


OH 


N(Me) 


29 


OH 


H 


H 


H 


NH2 


30 


OH 


H 


H 


H 


NO2 



Figure 6: Structures of new proposed tetracyclines - 




OH O QH 
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Table 5: EIM descriptors calculated to the new proposed TC and the proposed biological qualitative activity (QA). 



I and A refer to the inactive and active, repectively. 





AH 




OA 




AH 


TjH 


OA 




AH 


TyrL 


OA 


KJi. 


U.C5Z 


i.oy 




Ol 


U.oU 


1 


39 




Dl 


n 84 


1 49 




09 


n so 
u.ou 


i.oy 




OZ 


n Qn 
u.yu 


1 


39 




R9 


n 8*^ 
u.oo 


1 42 




KJO 


U.C50 


1 '^Q 

I.oy 




'i'i 

oo 
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1 


39 




Do 


n 87 
u.o / 


1 49 




r\A 
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o4 


U.o 1 


1 
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1 49 




\JD 
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1 QQ 
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oo 


U.?S4 


1 
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oo 


n 8/1 

U.o4 


1 AO 




06 


0.82 


1.39 




36 


0.81 


1 


39 




66 


0.84 


1.42 




07 


0.76 


1.39 




37 


0.89 


1 


39 




67 


0.78 


1.42 




08 


0.96 


1.39 




38 


0.89 


1 


39 




68 


0.97 


1.42 




09 


1.13 


1.23 


A 


39 


1.04 


1 


39 




69 


1.00 


1.42 




10 


0.93 


1.39 




40 


0.91 


1 


39 




70 


0.94 


1.42 




11 


1.04 


1.38 


A 


41 


1.03 


1 


39 




71 


1.05 


1.41 




12 


0.97 


1.39 




42 


0.94 


1 


39 




72 


0.98 


1.42 




13 


0.91 


1.39 




43 


0.93 


1 


39 




73 


0.91 


1.42 




14 


0.84 


1.39 




44 


0.84 


1 


39 




74 


0.86 


1.42 




15 


0.82 


1.39 




45 


0.79 


1 


39 




75 


0.84 


1.42 




16 


0.86 


1.39 




46 


0.90 


1 


39 




76 


0.88 


1.42 




17 


0.91 


1.39 




47 


0.93 


1 


39 




77 


0.92 


1.42 




18 


0.94 


1.39 




48 


0.92 


1 


39 




78 


0.95 


1.42 




19 


0.90 


1.39 




49 


0.85 


1 


39 




79 


0.92 


1.42 




20 


0.78 


1.39 




50 


0.81 


1 


39 




80 


0.79 


1.42 




21 


0.05 


1.39 


A 


51 


0.16 


1 


39 


A 


81 


0.08 


1.42 


A 


22 


1.18 


-0.26 


A 


52 


1.02 


1 


39 




82 


1.23 


0.28 


A 


23 


0.26 


1.39 


A 


53 


0.43 


1 


39 


A 


83 


0.28 


1.43 


A 


24 


1.17 


-0.50 


A 


54 


1.19 


-0.55 


A 


84 


1.22 


-0.44 


A 


25 


0.68 


1.39 


I 


55 


0.83 


1.39 




85 


0.73 


1.43 


I 


26 


0.85 


1.39 


I 


56 


0.86 


1.39 




86 


0.85 


1.42 


I 


27 


0.03 


1.39 


A 


57 


0.19 


1.40 


A 


87 


0.25 


1.43 


A 


28 


0.13 


-1.39 


A 


58 


0.03 


1.39 


A 


88 


0.04 


1.42 


A 


29 


0.17 


1.39 


A 


59 


0.34 


1.39 


A 


89 


0.22 


1.43 


A 


30 


1.16 


-0.38 


A 


60 


1.17 


1.33 


A 


90 


1.22 


-0.19 


A 
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Figure 7: Structures of new proposed tetracyclines - Tp 
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The EIM and PCA methodologies consider the investigation of the correlation of theoretical descriptors with 
biological experimental data through linear combinations of variables. To verify the behavior of EIM descriptors 
(data from Table 3) in non linear correlations with experimental data we performed property-activity relationship 
searches using Artificial Neural Network methodology (ANN) [43, 44]. 

For ANN investigations we considered the 14 molecules studied in section III-A as the training group and the 
EIM parameters as network variables. In this trainning pattern we required the interactions precision of 10^^ and 
a 3 X 10^ of interactions. In Table 6 we present the parameters used in ANN calculations, where a is a nonlinearity 
parameter of sigmoid functions of neurons in the second and third neuron's layer, is a threshold neuron parameter 
in a second and third layers, and e is a constant weight mixing the weight matrix between recursive cicles. 

In Table 7 we present the results for the training with the 14 molecules. The input data was set as class 1 to 
active molecules (1.000 0.000 - pattern) and class 2 to inactive molecules (0.000 1.000 - pattern). The columns four 
and five present the output values of training compared to the 0.000 and 1.000 values of input. We can observe that 
output values are very close to input, indicating a good training of the network. This good learning was achieved 
with precision parameters in cycle 242,477. 

In Table 8 we present the results of the ANN classification, classes 1 and 2 correspond to active and inactive 
compounds, respectively. For comparision the EIM classification is also displayed. The results of EIM and ANN are 



Table 6: Parameters of ANN for training pattern. 
Layer Neurons a 9 e 
~ 3 

2 20 3.0 0.0 0.2 

3 2 3.0 0.0 0.2 
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Table 7: Input and output pattern of ANN training using EIM index of AMI calculations of table 3. 



No. Trainning Pattern Output Trainning Class 



01 


l.UUU (J. 000 


A AAr' A A A A 

0.996 0.004 


1 


02 


1 r\r\(\ A AAA 

1.000 0.000 


1 AAA A AAA 

1.000 0.000 


1 


03 


1 AAA A AAA 

1.000 0.000 


1 AAA A AAA 

1.000 0.000 


1 


04 


1.000 0.000 


1.000 0.000 


1 


05 


1.000 0.000 


1.000 0.000 


1 


06 


1.000 0.000 


1.000 0.000 


1 


07 


1.000 0.000 


0.997 0.003 


1 


08 


1.000 0.000 


1.000 0.000 


1 


10 


1.000 0.000 


1.000 0.001 


1 


11 


1.000 0.000 


1.000 0.000 


1 


12 


1.000 0.000 


1.000 0.000 


1 


09 


0.000 1.000 


0.000 1.000 


2 


13 


0.000 1.000 


0.005 0.995 


2 


14 


0.000 1.000 


0.003 0.997 


2 



in very good agreement. Only 6 compounds (11, 12, 39, 41, 52, and 71) out of ~90% (agreement) present conflictive 
classification. Interestingly all these 6 molecules arc classified as active by ANN and inactive ones by EIM, which 
seem to have more restrictive conditions for active classification. 

Finally, it is interesting to note the role played by substitucnts in the biological behavior of the new set of 
tetraciclines. According to our qualitative analysis the main substitution requirement for biological activity is the 
presence of NH2 or NO2 or dialkylamino groups at C7 and/or C9. Even for non-active derivative 13 (Table I) the 
presence of these substitucnts classifies as active new compounds 81-84 and 87-90 (Table VIII). Despite the whole 
molecular skeleton composed of 4 fused rings and the integrity of the ring A, the rest of the molecule including 
position C5 and C6 just plays minor role on the biological activity of tetracyclines. 

4 Summary and Conclusions 

In this work we applied the EIM methodology in the classification of activity of 14 derivatives of the Tetracicline 
and in the predictive classification of the activity of 90 hypothetical new derivatives. Our results showed that the 
EIM results are not method dependent indicating that the classification of activity of Tetraciclines is reproducible 
for different semiempirical methods. In comparison with conventional physicochemical descriptors the importance 
of EIM descriptors is reinforced by PCA and ANN methodologies. The rules constructed with EIM, PGA and ANN 
methods reproduced the experimental data with 100%, 96% and 100% of accuracy, respectivelly. For the new 90 
derivatives proposed EIM and ANN are in good agreement with a percentage of 93% consistent classification. These 
results add to increasing data of EIM studies indicating the pure quantum electronic indices can be reliably used to 
classify the biological activity of large number of different classes of materials. 

We hope the present study estimulate further experimental studies for tetracyclines in order to try to produce 
new antibiotic compounds. 
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Table 8: Results of the ANN prediction to the new TC compounds using only EIM parameters. Classes 1 and 2 



indicate active and inactives compounds respectively 







ANN 




EIM 






ANN 




EIM 






ANN 




EIM 


No. 


Predict 


Class 


Class 


No. 


Predict 


Class 


Class 


No. 


Predict 


Class 


Class 


1 


0.002 


0.998 


2 


2 


31 


0.001 


0.999 


2 


2 


61 


0.000 


1.000 


2 


2 


2 


0.001 


0.998 


2 


2 


32 


0.017 


0.983 


2 


2 


62 


0.000 


1.000 


2 


2 


3 


0.001 


0.999 


2 


2 


33 


0.015 


0.986 


2 


2 


63 


0.000 


1.000 


2 


2 


4 


0.044 


0.959 


2 


2 


34 


0.001 


0.999 


2 


2 


64 


0.000 


1.000 


2 


2 


5 


0.001 


0.999 


2 


2 


35 


0.000 


0.999 


2 


2 


65 


0.000 


1.000 


2 


2 


6 


0.002 


0.998 


2 


2 


36 


0.001 


0.999 


2 


2 


66 


0.000 


1.000 


2 


2 


7 


0.001 


0.999 


2 


2 


37 


0.007 


0.993 


2 


2 


67 


0.000 


1.000 


2 


2 


8 


0.383 


0.633 


2 


2 


38 


0.002 


0.998 


2 


2 


68 


0.000 


1.000 


2 


2 


9 


1.000 


0.000 


1 


1 


39 


1.000 


0.000 


1 


2 


69 


0.087 


0.911 


2 


2 


10 


0.066 


0.938 


2 


2 


40 


0.005 


0.995 


2 


2 


70 


0.000 


1.000 


2 


2 


11 


1.000 


0.000 


1 


1 


41 


1.000 


0.000 


1 


2 


71 


1.000 


0.000 


1 


2 


12 


0.924 


0.080 


1 


2 


42 


0.032 


0.970 


2 


2 


72 


0.002 


0.998 


2 


2 


13 


0.016 


0.984 


2 


2 


43 


0.021 


0.980 


2 


2 


73 


0.000 


1.000 


2 


2 


14 


0.001 


0.999 


2 


2 


44 


0.001 


0.999 


2 


2 


74 


0.000 


1.000 


2 


2 


15 


0.002 


0.998 


2 


2 


45 


0.001 


0.999 


2 


2 


75 


0.000 


1.000 


2 


2 


16 


0.005 


0.995 


2 


2 


46 


0.015 


0.985 


2 


2 


76 


0.000 


1.000 


2 


2 


17 


0.007 


0.993 


2 


2 


47 


0.015 


0.986 


2 


2 


77 


0.000 


1.000 


2 


2 


18 


0.138 


0.870 


2 


2 


48 


0.004 


0.996 


2 


2 


78 


0.000 


1.000 


2 


2 


19 


0.006 


0.994 


2 


2 


49 


0.001 


0.999 


2 


2 


79 


0.000 


1.000 


2 


2 


20 


0.000 


1.000 


2 


2 


50 


0.000 


1.000 


2 


2 


80 


0.000 


1.000 


2 


2 


21 


1.000 


0.000 


1 


1 


51 


1.000 


0.000 


1 


1 


81 


1.000 


0.000 


1 


1 


22 


1.000 


0.000 


1 


1 


52 


1.000 


0.000 


1 


2 


82 


1.000 


0.000 


1 


1 


23 


1.000 


0.000 


1 


1 


53 


1.000 


0.000 


1 


1 


83 


1.000 


0.000 


1 


1 


24 


1.000 


0.000 


1 


1 


54 


1.000 


0.000 


1 


1 


84 


1.000 


0.000 


1 


1 


25 


0.004 


0.995 


2 


2 


55 


0.001 


0.999 


2 


2 


85 


0.000 


1.000 


2 


2 


26 


0.002 


0.998 


2 


2 


56 


0.001 


0.999 


2 


2 


86 


0.000 


1.000 


2 


2 


27 


1.000 


0.000 


1 


1 


57 


1.000 


0.000 


1 


1 


87 


1.000 


0.000 


1 


1 


28 


1.000 


0.000 


1 


1 


58 


1.000 


0.000 


1 


1 


88 


1.000 


0.000 


1 


1 


29 


1.000 


0.000 


1 


1 


59 


1.000 


0.000 


1 


1 


89 


1.000 


0.000 


1 


1 


30 


1.000 


0.000 


1 


1 


60 


1.000 


0.000 


1 


1 


90 


1.000 


0.000 


1 


1 



14 



5 Acknowledgements 

This work has been supported by the Brazihan agency FAPESP and also thanks to CAPES, CNPq, FAPEMIG, and 
IMMP. 

References 

[I] S. B. Levy. S. B.; The Antibiotic Paradox: How Miracle Drugs are Destroying the Miracle. 1992. Plenum Press, 
New York. 

[2] Roberts, M. C; Tetracycline resistance determinants: Mechanisms of action, regulation of expression, genetic 
mobility, and distribution. FEMS Microbiology Reviews, 1996, 19, 1-24. 

[3] Humbert, P.; Treffel, P.; Chapuis, J. F.; Buchet, S.; Dcrancourt, C; Agache. P. The tetracyclines in dermatology. 
Journal of The American Academy of Dermatology, 1991, 25, 691-697. 

[4] Lertvorachon, J.; Kim, J. P.; Soldatov, D. V.; Boyd, J.; Roman, G.; Cho, S. J.; Popek, T.; Jung, Y. S.; Laua, P. 
C. K.; Konishia. Y. Bioorganic & Medicinal Chemistry, 2005, 13, 4627-4637. 1,12-Substituted tetracyclines as 
antioxidant agents. 

[5] Fife, R. S.; Sledge Jr., G. W.; Roth, B. J.; Proctor. C. Effects of doxycycline on human prostate cancer cells in 
vitro. Cancer Letters, 1998, 127, 37-41. 

[6] Eckhardt, S. G.; Hidalgo, M. Development of matrix metalloproteinase inhibitors in cancer therapy. Journal of 
the National Cancer Institute, 2001, 93, 178-193. 

[7] Levy, S. B.; McMurry, L. M.; Barbosa, T. M.; Burdett, V.; Courvahn, P.; Hillen, W.; Roberts, M. C.; Rood, J. 
I.; Taylor. D. E. Antimicrob. Agents Chemother, 1999, 43, 1523-1524, . 

[8] Roberts. M. C. FEMS Microbiology Reviews, 2005, 245, 195-203. Update on acquired tetracycline resistance 
genes. 

[9] G. Sengclov, B. Halling-Sorensen, and F. M. Aarestrup. Veterinary Microbiology, 95:91-101, 2003. Susceptibility 
of Escherichia coli and Enterococcus faecium isolated from pigs and broiler chickens to tetracycline degradation 
products and distribution of tetracycline resistance determinants in E. coli from food animals. 

[10] Levy, S. B.: Hedges, R. W.; Sullivan, F.;Medeiros, A. A.; Sosroseputro. H. Multiple antibiotic-resistance plasmids 
in cnterobactcriaccac isolated from diarrheal specimens of hospitalized children in Indonesia. J. Antimicrob. 
Chemother, 1985, 16, 7-16. 

[II] Jones, C. S.; Osborne, D. J.; Stanley. J. Enterobacterial tetracycline resistance in relation to plasmid incom- 
patibiUty. Mol. Cell Probes, 1992, 6, 313-317. P. M. V. B. 

[12] Chartone-Souza, E.; Loyola, T. L.; Bucclarelli-Rodriguez, W.; Menezes, M. A. D.; Rey, N. A.; Pereira-Mala. 
E. C. Synthesis and characterization of a tetracycline-platinum (ii) complex active against resistant bacteria. 
Journal of Inorganic Biochemistry, 2005, 99, 1001-1008. 

[13] Oliva, B.; Chopra. I. Tet determinants provide poor protection against some tetracyclines - further evidence for 
division of tetracyclines into 2 classes. Antimicrob. Agents Chemother, 1992, 36, 876-878. 



15 



[14] Hailing- Sorensen, B.; Sengelov, G.; Tornelund. J. Toxicity of tetracyclines and tetracycline degradation products 
to environmentally relevant bacteria, including selected tetracycline-resistant bacteria. Arch. Environ. Contam. 
Toxicol, 2002, 42, 263-271. 

[15] Cosentino, U.; Var, M. R.; Saracino, A. A. G.; Pitea, D.; Moro, G.; Salmona, M. Tetracycline and its analogues 
as inhibitors of amyloid fibrils: searching for ageometrical pharmacophore by theoretical investigation of their 
conformational behavior in aqueous solution. J. Mol. Model., 2005, 11, 17-25. 

[16] Othersen, O. G.; Beierlein, F.; Lanig, H.; Clark, T. Conformations and tautomers of tetracycline. J. Phys. 
Chem. B, 2003, 107, 13743-13749. 

[17] Lambs, L.; Decock-Le Reverend, B.; Kozlowski, H.; Bcrthon, G. Metal ion-tetracycline interactions in biological- 
fluids .9. circular-dichroism spectra of calcium and magnesium complexes with tetracycline, oxy tetracycline, 
doxycycline, and chlortetracycline and discussion of their binding modes. Inorg. Chem., 1988, 27, 3001-3012. 

[18] Lambs, L.; Berthon, G. Metal-ion tetracycline interactions in biological-fluids .7. quantitative investigation of 
methacycline complexes with ca(ii), mg(ii), cu(ii) and zn(ii) ions and assessment of their biological significance. 

Inorg. Chim.. Acta, 1988, 151, 33-43. 

[19] Brion, M.; Lambs, L.; Berthon, G. Metal iontetracycline interactions in biological-fluids .6. formation of cop- 
per(ii) complexes with tetracycline and some of its derivatives and appraisal of their biological signiflcance. Inorg. 
Chim. Acta, 1986, 123, 61-68. 

[20] Lambs, L.; Brion, M.; Berthon, G. Metal-ion tetracycline interactions in biological-fluids .4. potential influence 

of ca-2+ and mg-2+ ions on the bioavailability of chlortetracycline and demethylchlortetracycline, as expected 
from their computer-simulated distributions in blood-plasma. Inorg. Chim. Acta, 1985, 106, 151-158. 

[21] Bi, S. Y.; Song, D. Q.; Tian, Y.; Zhou, X.; Liu, X. Y.; Zhang, H. Q. Molecular spectroscopic study on the 
interaction of tetracyclines with serum albumins. Spectrochimica Acta Part A - Molecular and Biomolecular 
Spectroscopy, 2005, 61, 629-636. 

[22] Baptiste, D. C; Hartwick, A. T. E.; JoUimore, C. A. B; Baldridge, W. H.; Seigel, G. M.; Kelly, M. E. M. An 
investigation of the neuroprotective effects of tetracycline derivatives in experimental models of retinal cell death. 
Molecular Pharmacology, 2004, 66, 1113-1122. 

[23] Nicolas, L; Vilchis, M.; Aragon, N.; Miranda, R.; Hojer, G.; Castro, M. Theoretical study of the structure and 
antimicrobial activity of horminone. International Journal of Quantum Chemistry, 2003, 93, 411-421. 

[24] Zerner, M. C; dos Santos, H. F.; de Almeida, W. B. Conformational analysis of the anhydrotetracycline 
molecule: A toxic decomposition product of tetracycline. Journal of Pharmaceutical Sciences, 1988, 87, 190- 
195. 

[25] Belletato, P.; de Almeida, W. B.; dos Santos, H. F.; Nascimento, C. S. The conformational and tautomeric 
equilibrium of 5a,6-anhydrotetracycline in aqueous solution at ph 7. J. Mol. Structure- Theochem, 2003, 626, 
305-319. 

[26] de Miranda. C. F.; Costa, L. A. S.; de Almeida, W. B.; dos Santos, H. F.; Macial, B. L. Structure and 
properties of the 5a,6-anhydrotetracycline platinum(ii) dichloride complex: A theoretical ab initio study. Journal 
of Inorganic Biochemistry, 2006, 100(10), 1594-1605. 



16 



[27] Peradejordi, F.; Martin, A. N.; Cammarata, A. Quantum chemical approach to structure-activity relationships 
of tetracycline antibiotics, Journal of Pharmaceutical Science, 1971, 60(4), 576. 

[28] Dewar, M. J. S.; Zoebish, E.G.; Healy, E.F.; Stewart, J. J. P. The development and use of quant mn- mechanical 
molecular-models .76. AMI - A new general-purpose quantum-mechanical molecular-model. Journal American 
Chememical Society, 1985, 107, 3092-3909. 

[29] Stewart, J. J. P. Optimization of parameters for semiemprical methods .1. method, J. Comp. Chem, 1989, 
10(2), 209-220. 

[30] Stewart, J. J. P. Optimization of parameters for semiemprical methods .2. Applications, J. Comp. Chem., 1989, 
10(2), 221-264. MOPAC program, version 6.0 Quantum Chemistry Exchange Program No. 455. 

[31] Zerner, M.C. Reviews in Computational Chemistry, VCH, New Tork, 1991, 313-365. Semiempirical molecular 
orbital methods, in: K.B. Lipkowitz, D.B. Boyd (Eds.). 

[32] Paniago, E. B.; Tosi, L.; Bcraldo, H.: do Siqucira, .J. M.; CArvalho, S. Mctal-complcxcs of anhydrotetracycline 
.1. A spoctromctric study of the Cu(II) and Ni(II) Complexes, J. Pharm. ScL, 1994, 83, 291-295. 

[33] GarnierSuillerot, A.; Beraldo, H.; Machado, F. C; Demicheli. C. Metal-complexes of anhydrotetracycline .2. 
absorption and circular- dichroism study of Mg(ii), Al(iii), and Fe(iii) complexes - Possible influence of the Mg(ii) 
complex on the toxic side-eff'ects of Tetracycline, J. Inorg. Biochem., 1995, 60, 163-173. 

[34] Beraldo, H.; Matos, S. V. D. M. Mctal-complcxcs of anhydrotetracycline .3. An absorption and circular- 
dichroism study of the Ni(ii), Cu(ii) and Zn(ii) complexes in aqueous-solution, J. Braz. Chem. Soc, 1995, 
6(4), 405-411. 

[35] M. Cyrillo, M.; Galvao, D. S. Structure-activity relationship study of some inhibitors of HIV-1 integrase, J. 
Mol. Structure-Theochem, 1999, 464(1-3), 267-272. 

[36] Santo, L. L. D.; Galvao, D. S. Structure-activity study of indolequinones bioreductive alkylating agents, J. Mol. 
Structure-Theochem, 1999, 464(1-3), 273-279. 

[37] Baronc, P. M. V. B.; Camilo Jr, A.; Galvao, D. S. Theoretical approach to identify carcinogenic activity of 
polycyclic aromatic hydrocarbons, Phys. Rev. Lett., 1996, 77, 1186-1189. 

[38] Braga, R. S.; Vendrame, R.; Galvao, D. S. Structure-activity relationship studies of substituted 17 alpha- 
acetoxyprogesterone hormones, Journal of Chemical Information and Computer Sciences, 2000, 40(6), 1377- 
1385. 

[39] Braga, S. F.; Galvao, D. S. Benzo[c]quinolizin-3-ones theoretical investigation: SAR analysis and application to 
nontested compounds. Journal of Chemical Information and Computer Sciences, 2004, 44(6), 1987-1997. 

[40] Troche, K. S.; Braga, S. F.; Coluci, V. R.; Galvao, D. S. Carcinogenic classification of polycyclic aromatic 
hydrocarbons through theoretical descriptors. International Journal of Quantum Chemistry, 2005, 103(5), 718- 
730. 

[41] Braga, S. F.; Galvao, D. S. A structure - Activity study of taxol, taxotere, and derivatives using the electronic 
indices methodology (EIM), Journal of Chemical Information and Computer Sciences, 2003, 43(2), 699-706. 

[42] Einsight 3.0. 1991. Infometrix, Inc 2200. Sixth Avenue, Suite 833, Seatle WA, 98121. 



17 



[43] H. Ichikawa. 1991. PSDD: Perceptron-type Neural Network Simulator, QCPE 615, Indiana, USA. 

[44] Ayoama, T.; Suzuki, Y.; Ichikawa, H. Neural networks applied to pharmaceutical problems .1. method and 
application to decision-making, Chem. Pharm. Bull., 1989, 37(9), 2558-2560. 



18 



